Relaxations for inference in restricted Boltzmann machines

نویسندگان

  • Sida I. Wang
  • Roy Frostig
  • Percy Liang
  • Christopher D. Manning
چکیده

We propose a randomized relax-and-round inference algorithm that samples near-MAP configurations of a binary pairwise Markov random field. We experiment on MAP inference tasks in several restricted Boltzmann machines. We also use our underlying sampler to estimate the log-partition function of restricted Boltzmann machines and compare against other sampling-based methods. 1. Background and setup A binary pairwise Markov random field (MRF) over n variables x ∈ {0, 1} models a probability distribution pÃ(x) ∝ exp(xÃx). The non-diagonal entries of the matrix à ∈ Rn×n encode pairwise potentials between variables while its diagonal entries encode unary potentials. The exponentiated linear term xÃx is the negative energy or simply the score of the MRF. A restricted Boltzmann machine (RBM) is a particular MRF whose variables are split into two classes, visible and hidden, and in which intra-class pairwise potentials are disallowed. Notation We write Symn for the set of symmetric n× n real matrices, and S to denote the unit sphere {x ∈ R : ‖x‖2 = 1}. All vectors are columns unless stated otherwise. 1.1. Integer quadratic programming Finding the maximum a posteriori (MAP) value of a discrete pairwise MRF can be cast as an integer quadratic program (IQP) given by max x∈{−1,1}n xAx (1) International Conference on Learning Representations, Banff, Canada, 2014. ∗Authors contributed equally. Note that we have the domain constraint x ∈ {−1, 1} rather than {0, 1}. We relate the two in Section 2.3.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Deep Learning using Restricted Boltzmann Machines

Restricted Boltzmann machines (RBM) are probabilistic graphical models which are represented as stochastic neural networks. Increase in computational capacity and development of faster learning algorithms, led RBMs to become more useful for many machine learning problems. RBMs are the building blocks of many deep multilayer architectures like Deep Belief networks (DBN) and Deep Boltzmann Machin...

متن کامل

Discrete restricted Boltzmann machines

We describe discrete restricted Boltzmann machines: probabilistic graphical models with bipartite interactions between visible and hidden discrete variables. Examples are binary restricted Boltzmann machines and discrete näıve Bayes models. We detail the inference functions and distributed representations arising in these models in terms of configurations of projected products of simplices and ...

متن کامل

Elastic regularization in restricted Boltzmann machines: Dealing with $p\gg N$

Restricted Boltzmann machines (RBMs) are endowed with the universal power of modeling (binary) joint distributions. Meanwhile, as a result of their confining network structure, training RBMs confronts less difficulties (compared with more complicated models, e.g., Boltzmann machines) when dealing with approximation and inference issues. However, in certain computational biology scenarios, such ...

متن کامل

Cumulative Restricted Boltzmann Machines for Ordinal Matrix Data Analysis

Ordinal data is omnipresent in almost all multiuser-generated feedback questionnaires, preferences etc. This paper investigates modelling of ordinal data with Gaussian restricted Boltzmann machines (RBMs). In particular, we present the model architecture, learning and inference procedures for both vector-variate and matrix-variate ordinal data. We show that our model is able to capture latent o...

متن کامل

Phase transitions in Restricted Boltzmann Machines with generic priors

We study generalized restricted Boltzmann machines with generic priors for units and weights, interpolating between Boolean and Gaussian variables. We present a complete analysis of the replica symmetric phase diagram of these systems, which can be regarded as generalized Hopfield models. We underline the role of the retrieval phase for both inference and learning processes and we show that ret...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره abs/1312.6205  شماره 

صفحات  -

تاریخ انتشار 2013